Evaluation of Endogenous Systems
نویسندگان
چکیده
In system development we are faced with the necessity of evaluation. Evaluation measures our success relative to other: a) theories of development or domains; b) implementations of similar theoretical principles; or c) increments of a given system. For successful software engineering evaluation, progress is measured against a model, a task frequently accomplished through requirements analysis. This model is lacking in natural language processing (NLP) systems (not to be confused with neurolinguistic programming). NLP systems defy evaluation in part because they model an endogenous process – where the whole process is irreducible. Therefore, while specific feature-based evaluations appear reasonable, they fail to capture an overall measure of success. In this paper, we look at the part/whole aspects of evaluation in more detail with regard to one language system type – machine translation. INTRODUCTION In system development, of any type of system, we are faced with the necessary evil of evaluation. Formal evaluation measures the success we are having relative to: a) other theories of development or domain modeling; b) other implementations of the same theoretical principles; c) other systems for the purpose of purchase or funding; or d) previous increments of a given system. For successful evaluation in software engineering, one needs a model against which progress can be measured, a task frequently accomplished through requirements analysis. For many domains, this is a straight-forward process: a correct answer exists, such as, “Pushing F10 results in program termination.” Many types of NLP systems, however, have defied rigorous evaluation because of this lack of defining criteria or requirements. Part of the difficulty lies in the fact that NLP systems are modeling an endogenous process – where the whole of the process is irreducible, context-dependent and lacking a unique right answer. Therefore, while specific feature-based evaluations appear reasonable, they fail to capture an overall measure of success. In this paper, we look at the part/whole aspects of evaluation in more detail with regard to one language system type – machine translation (MT). This paper starts with a description of the MT process as traditionally developed. It then describes the resulting evaluation strategies that follow from these views of MT. The features of MT which categorize it as an endogenous are presented as a precursor to showing a new view of the original MT vision. Finally, the questions of evaluation are explored within this new view.
منابع مشابه
Evaluation of an Integrated Cryogenic Natural Gas Process with the Aid of Advanced Exergy and Exergoeconomic Analyses
In this study, an integrated structure of the air separation unit, natural gas liquids recovery equipped with nitrogen removal unit is developed. In this regard, advanced exergy and exergoeconomic analyses are used to examine the irreversibility, possible improvements and the cost of the inefficiencies of the process. The exergy analysis presents information on the origin of the irreversibility...
متن کاملAdvanced Exergy Evaluation of an Integrated Separation Process with Optimized Refrigeration System
Advanced exergy analysis is a tool to split the exergy destruction of the system to achieve a better perspective about the potentials of a system for improvements. In addition, the component interactions and their exergy destruction dependency with the other equipment are investigated through the advanced exergy analysis. For this purpose, it divides the exergy destruction calculated by convent...
متن کاملA Review on the Evaluation Methods of Health Information Systems
Evaluation of health information systems is an effective approach to ensure the efficacy of the system which can lead to the improvement in quality of health care services. The main aims of evaluation are to identify the strength and efficiency of system in health care delivery, to identify the weaknesses of the system, and to suggest general recommendations for improving the performance of the...
متن کاملEvaluation of recommender systems: A multi-criteria decision making approach
The evaluation and selection of recommender systems is a difficult decision making process. This difficulty is partially due to the large diversity of published evaluation criteria in addition to lack of standardized methods of evaluation. As such, a systematic methodology is needed that explicitly considers multiple, possibly conflicting metrics and assists decision makers to evaluate and find...
متن کاملA short overview of the electrical machines control based on Flatness-technique
Optimal linear controllers and high computational non-linear controllers are normally applied to control the nonlinear systems. Flatness control method is a control technique for linear systems as well as nonlinear systems by static and dynamic feedback namely as endogenous dynamic feedback. This method takes into account the non-linear behavior of the process while preventing complicated compu...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2002